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SEQUENCE LISTING 

(DCEKEBALDIFOBlUnOill: 



OAFFUGUrr. Evans, Ronald IL 
No,Darid 




PRODVCISRQiTEDIHGREn 



(iiDKIBIBERIIFSGIlimmiS 

Qv)CORRESrONDmADDR^: 

(A) ADDRESSES: Cra; Gary Ware iiFrcidciirich LLP 

(B) 436S becBlive Drite, Suile ICOO 
(DCmr: San Diego 

a))STA1E:CA 

(DOOUriEKllSA 

(nZIP:92121-2IS9 

(f)COHPim READABLE FORll 
' (A)HEDniH'nTE:Floppjdisk 
(B)COHPiniS:]BUPC compatible 
(0 OPERATING SISTEH: PC-DOS/US-DOS 
aDSOFraARKFalenUiReleas #1.1^ Venion #1.25 

(tOCDRBENTAPPUanOIIDATA: 
(AlAPPUCinONMIIIBER: 
(B)nUKGDAIE. 
(OOASSnGATnill: 

(ii» PRIOR APPUCmON DATA: 

tt)APPUCI1I0NI!liHBER: OS/974,S30 
(B)FIUNG DATE: Noiemlier 19, 1997 
(OGLAmCAHOIIt 

(iilOATT0RM/AtB«TINF0RlUK 
tt)NAnE:Reiter, Stephen E 
WREGISIRAnOIINUHBER: 31,192 
(ORffQIEKGE/DOasrinnfllQL SAU(IS20-2 

(iiOmECOlDIDIIIIGAnOIIIINFORIUnOlll: 
tt)im3'flOIIE: 619-677-1409 
ODTQjrAX: 619-677-1465 



(2)IKF0SIU110jliF0RSEQIDN0:l: 

(i)SEQM(X(»ARAnmiS11CS: 
(A)LBIiGTB:7laniii»acids 
(BITITE: amino acid 
(OSmumOINESSrsimle 
WTOPOUGKMh 



(iDllOLECIIUnT&prateiD 
(?)nUGIIBITTliTE:iiiienial 



(li) SEAUEKCE DESCRIP1I0III: SEQ ID KO:i: 

Xaa Xaa Cp Xaa Xaa Atp Xaa Ala Xaa Glj Xaa 1)t Xaa Xaa Xu 
IS 10 IS 

Xaa Gfi Xaa Xaa C{s Xaa riK r be Xaa Arg Xaa Xaa Xaa Xaa Xaa 

. 20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Cjfi Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cp 
S 40 45 

Xaa Xaa Xaa Lji Xaa Xaa Arg Xaa Xaa Xaa Xaa G)fs Ai{ Xaa Xaa 
50 55 60 

LjiQsXaaXaaXaaGljHet 
65 70 

GOMFORUAIIOIIIFOBSEQIDIinhS: 

(DSGQUEMSdUUPIfMIKS: 

(A) lAm 5 amino adds 
WnrKamiiioacId 
(0S1IIA]a)B)IIESS:A|gle 

(B) TOPOLOGY:iiiikoinm 

GDHOUCilUnrfipepGde 
MFBAGHEVrnTE: internal 



(d) SEtUEinZ DESCEIPinil: m ID m 

GlnGlfC^IjsGlT 
1 S 

(2)IlliniRHAiniliniRS01IDKOJ: 

OSEWlEHHZOAIAnQimiGS: 
a)LENG18:5andnoacldi 
(B) TYPE: amino add 
(OSmUfDEDKESS: angle 
(D)TOPOlOCr:nnbioini 

aOHOUCDUTlfFKpepOde 



WniACMBn'Tli'rE: internal 



58 



(A) mma desquphok seq id ho:S: 

GljSerCril^Val 
I 5 

(2)DiF0IIIU11IIIIF0RSEItIDK(l:4: 

(OSGQllEHCXIllRACIISIISinS: 
U)LEK6TH: 2241 base pain 
(BjTlTEDUcletcacid 

(osmmmmboui 

a))T0r0UI6V:boUl 
(ii)UOLEClilSmcDIU 



(ii)FE4niR£ 
U)IUHE/KEY:(3IS 
(B)lOGinOIII:L224l 

aUOIHERniFtlBlUTIOK/proiliiGis "rgEtR" 



(li) SE(lliEK(£ DESOUPnON: SEQ D) AM 

ATCGCCCCCCCGAGCeATGTCAGCCTGGGGGACGAGrrCGlCTniUG tt 
M JUa Fro Pro Hit Aq) Val Ser Lea Glj Asp Cla Leu His Leo Asp 
1 S 10 IS 

66CGAGGACGTGGCGATGGC6GATGGCGA0GG6GTA6ACGAT1TC6AT 96 
GlfGluAipValAbHelAbHisAlaAqiilaLenAspAspPheAsp 
20 25 30 

CIC GACATG TTG GGG GAG GGG GATTCCGCG GGTCCG GGA ITTACCCCC 144 
Leo Asp Hel Leo Glj Asp Gij A^ Ser Pro Gl; Fro Gl; Phe nir Pro 
35 40 45 

aCGACTECGCCOGCTACGGCGCrCTGCATAICGCCIiACmGAGTIT 192 
ffiiAipSerAlarniyrCljAlaLeaAspHetAlaAspPlKGiaPh! 
50 55 60 

GAGCAGATGmACCGATGOCCrrGGAATrGACGAGTACGGTGGGlAG 240 
GiD CId He( Fhe nr Asp Ala Leo Gpe Asp Glo 1^ Glj Glj L]s 
65 70 75 SO 

CrrCTACGTAGCTITAGAAGGATATIXAATIVrATATCTTCAGGTCGC 2U 
Lei Leu ((r Hit Ser Arf Arg De Ser Asn Ser He Ser Set Glj Ar( 

i5 90 95 

GATGATCrCTCGOCrTCGAGCAGCITGAACGGATACTCGGCGAACGAA 336 
AspAspLeoSerPnSerSerSerLeDAsaGljT^TSerAlaAsnGlo 
100 105 110 



AIXTGCGATGOGAAGAAGAGCAAGAAGGGAOCrGCGOCtCGCGTGGAA 384 
Ser CfS Asp Ala Lj8 Ljs Scr Lji L]^ Glj Fro Ala Pro Arg Val Gin 
115 120 125 



GAG GAG CTG TGCCrC CTTTCC GGC GAC AGG GCCTCC GGC TAG CACTAC 432 
Gla Glu Leo Leu Val Cyis 61; Asp Arg Ala Scr Glj T)t His 1)t 
130 I3S 140 

AACCCCCTCACCTGICCA TCCTCCAAG CTG TTCTTrCGA CCCAGCGTT 4110 
AsD Ala Leo Ibr Crs eij Ser Cfs L}8 Val r he Phe Arj Arg Ser Val 
145 150 155 160 

ACG AAC AGG GGC GTC TAG TCC TCC AAG TO GGG GGC GGC TCC GAA ATC 528 
Hit Lj8 Ser Ala Yal 1^ C;ib Ljs Plic Gi; Arj Ala C]|S Gla Uct 
165 170 175 

GAC ATC TAG ATG AGG OGA AAG TGT GAG GAG TCC GGC (TTG AAA AAG TCC 576 
Asp lie! Tp* Hel Arg Arg Ljs Cfs Gla Glo Cp Arg Leu Ljs L}^ 
110 Its 190 

CTG GGC GTG GGT ATG GGG GGG GAA TGC GTC GTC GGG GAG AAG GAA TGT 624 
Leo Ala Yal Glj Het Arg Pro Gla Cjs Val Val Pro Glu Asn Gin (;p 
195 200 205 

GGG ATG AAC GGG GGC GAA AAG AAG GOG GAG AAG GAG AAG GAC AAA ATG 672 
Ala HeC Ljs Arg Arg Gta Ljs Ljs Ala Gin Ljs Glo Ljs Asp Ljs Met 
210 215 220 

ACCACrTCCCCGAGCTCTCAGCATGCCGGCAATGGCAGCraCCCTirr 720 
llir Tbr Ser Pro Ser Ser Glo His Glj Glj Asn Glj Ser Lea Ala Ser 
225 230 235 210 

GGT GGC GGG GAA GACHTGIT AAG AAG GAG ATT GTT GAC CTT ATG AGA 768 
GljGlfGljGlnAspPbeValLjsLjsGlDlieLeaAspLeoHetTtir 
2(5 250 255 

TGC GAG CCG CCC GAG CAT GCC ACT ATT CCG CTA GTA CCT GAT GAA ATA 316 
Cfs Glo Pro Pro Glo His Ala Thr He Pro Leo Leo Pro Asp Gla He 
260 265 270 

m GOCAAG TGTCAA GGG GGC AAT ATA GCTTCGTTA ACG TAG AATCAG 364 
Leo Ala Ljs Cfs Glo Ala Arg Asn He Pro Ser Leo llir IJt Asn Gin 
275 230 235 

ra GCC Gn ATA TAG AAC Ttk ATT TCC TAG CtG GATGCC TAT GAG UG 912 
Leo Ala Val He Tjr Ljs Leo He Trp Gin Asp Glj Ijr Glo Gin 
290 295 300 

OGATCTGAAGAGGATPrCAGGCGTATAATGAGTCAACCCGATGAGAAC 960 
Pro Ser Glo Glo Asp Leo Arg Arg Be llcl Ser Glo Pro Asp Glo Asn 
305 310 315 320 

GAG AGG GAA ACG GAC GTC AGG TIT CGG CAT ATA ACCGAG ATA ACCATA 1003 
Glo Ser Glo llir Asp Val Ser Phe Arg His He Thr Glu He Hir He 
325 330 335 

PrCACGGTCUGTrcATrGTrGAGrrrGGTAAAGGTCrACCAGOGTrr 1056 
Leo Tlir Val Glo Leo He Val Gin Phe Ala Ljs Glj Lea Pro Ala Phe 
340 345 350 



ACi JUG ATA CCC CAG GAG GAC GAG ATC ACG TTA CTA AAG GCC TGC TOG 1 104 
Hit L}s De Pro Glo Glo Asp GId lie Hit Leu Leo Ljs Ala Qs Set 
3S5 360 36S 

TCG GAG GTG ATG ATG CT6 CGT ATG GCA CGA GGC TAT GAC GAC AGC TCG I IS2 
SerGlo Val HelHelLeoAisHet Ala Atj AifljrJUp HisScr Ser 
370 37S 3S0 

GACTCAATATItTrCGCGAATAATACATCATATACCCCCGATTCTTAC 1200 
Asp Ser De Phe Fhe Ala AsD Asa Arg Scr 1)t Hit Arg Asp Scr i;ifr 
3SS 390 395 400' 

AAAATGCCCGGAATCGCrGATAACAnGAAGACCrGCFGCATITCTGC lUi 
Lji Hel Ala GljHet Ala Asp Asn lie Gla Asp Leo Lei Rlsrhe (jis 
40S 410 41$ 

CGC CAA ATG TTC TCG ATG AAG GTG GAC AAC GTC GAA TAC GCG CTT CTC 1296 
ArgGlnHelPheSo'lIedjiValAvAsnValGlaTiTAIaLeaLea . 
420 425 430 

ACrGCCAHGTC ATCTTCTCG GAC CGG CCG CGCCTG GAGAAG GCCGAA 1344 
Hit Ala De Val He Phe Ser Asp Arg Pro Glj Leu Gli Ala Gin 
435 440 445 

(TA GTCGAA GCG ATCCAG AGCTACTACATCGACACG CTA CGCAHTAT 1392 
Leo Val Gin Ala De GId Ser 1^ Tjfr lie Asp llir Leu Arg lie I^T 
450 455 460 

ATA CTC AAC GGC CAC TGC GGC GAC TGA ATG AGC CTC GTC TTC TAC GCA 1440 
De Leo Asa Arg His GIj Asp Ser Het Ser Leo Val Phe lyr Ala 
465 470 m 4S0 

AAGCTGPrCTCGATCCTCACCGAGCTGCGTACGCrrGGGCAACCAGAAC I4U 
Leo Leo Ser De Leu Hit Giu Lea Aig Hit Leu Gl; Aso Gin Asn 
4SS 490 495 

GCCGAGATGTGTTrCTGAmAAGCrCAAAAACOGCAAACrGCCCAAG 1536 
Ala Glu Met l!|i Pbe Ser Len Lji Leu Asa Arg Ijs Leu Pro L}i 
500 SOS 510 

ITCCTCGAGGAGATCTGGGACGTrCATGGCATCOCGCCATCGGTCGAG IS34 
Phe Leu Gin Glu De Itp Asp Val His Aia De Fro Pro Scr Val Gin 
515 520 525 

TCG CAC CIT CAG ATP AGC GAG GAG GAG AAC GAG CGT CTC GAG CGG GGP 1632 
Ser His Leu Gin De Hit Gin Gin Gin Asa Glu Arg LeoGlnAif Ala 
530 535 540 

GAG CGT ATG CGC GCA TCGGHGGG GGCGCCATrACCGCCGGCATrGAT 1630 

GluArgHetArgAlaSerValGljGljrAlanellirAlaGlflleAsp 

545 550 555 560 

TGCGACTiTGCCTCCACTTCGGCGGGGGCAGCCGCGGCCCAGCATCAG 1723 
lilsAspSerAlaSernrSerAlaAlaAlaAlaAlaAlaGlnHisGIn 
565 570 575 



Pro Gin Pro CIn Pro Gin Pro Gin Pro Ser Ser Leu llir Gin Asn Asp 
5S0 585 590 

TDCCACaCCACACAaGCKaGCrACUan'GtGCnOClCCTCAG 1824 
Ser Gin HU GId Hit CIn Pro Gin Leo Gin Pro Gin Leo Pro Pro Gta 
595 600 605 

CTGCAAGGTCUCTGCAACCCCAGCrCClAGCAUGnTCAGACGGAI 1(72 
ku Glo Glj GiD Leu Gin Pro Glo Leo Gin Pro Gin Leo Gin llir CIn 
610 615 620 

ncGACfuciGAncuccACAcn^CAGnciTrcixGicTixce^ 1920 

Leo Glo Pro Gin De Gin Pro Glo Pro Glo Lea Leo Pro Val Ser Ah 
625 630 635 610 

CKCTCaXGCCTCCGTAACCGCACCTGCTTCCrniTCCGCGGTCAGT 1963 
Pro Val Pro Ala Ser Val nr Ala Pro Glf Ser Leo Ser Ala Val Ser 
615 650 655 

ACG AGG ACCGAA TACATG GGCGGA AGTGCG GCCATA GGA GCG ATC ACG 2016 
11irSerSerGlDl)THelGlj.GlrSerilaAlalleGlrProlle11ir 
660 665 670 

CCG GGA ACC ACC AGC ACT ATC ACG GIT GCC CTTACC G(T AGC TCC AOC 2061 
Pro Ala Hit nir Ser Ser lie Hit Ala Ala Val Hit Ala Ser Ser nir 
675 630 635 

ACA TCA GOG GTA CCG ATG GGC AAC GGA GTT GGA GTC GGT GTT GG6 GT6 21 12 
IbSer Ala Val Pro HelGlj AnCI; Val GIf Val GIf Val G|f Val 
690 695 700 

GGC GGC AAC GTC AGC ATG TAT GCG AAC GCC GAG ACG GCG ATG GCC TIG 2160 
Glj Glj Asn Val Ser Het lyr Ala Aso Ala Gin nir Ala He( Ala Leo 
705 710 715 720 

ATG GGT GTA GCC CTG GITTOG GAC GU GAG CAG CTf ATC GGG GGA GTG 2203 
HetGljValAbLenHiiSerlitGlnGliGliiUoIleGlfGlifVal 
725 730 . 735 ' 

GCGGnAAGTCGGAGCACTCGACGACTGCATAG 2241 
Ala Val L}s Ser Glu His Ser Hit nir Ala 
740 745 



(2)nF0RIU1I0HF0RSl»lDN0:5: 

G) SEQUENCE CHARAnBUSnCS: 

(A) miH: 746 amino addi 

(B) TyP£-aflrinoaGl(l 
(D)T0P0L0GV:littear 

(H)HOI£CDI£nTK protein 



(li) SEQUEKOE DESOUPTION: SEQ ID KO;S: 



HetliaProProlbrAspyalSerLeDGIjAspGlaLeaHisLeuAip 
I 5 10 IS 

Gl; 61a Asp Val Ala Net Ala His Ala Asp Ala Leo Asp Asp Pbe Asp 
20 25 30 

iM Aq) llet Leo Gij Asp Glj Asp Ser Pro Glj Pro GljPhc Hit Pro 
3S 40 ^5 

His Aap Ser Ala Pro Gl; Ala Leu Asp llet Ala Asp Phe Glo Phe 
50 SS 60 

Glu Gin Hel Phe llir Asp Ala Lea 6|j He Asp GId T|t GIj GI| Ljs 
65 70 n SO 

LeuLenGljllirSerArgArtneScrAsoSerlleSerScrGljArg 
S5 90 95 

Atp Asp Lea Ser Pro Ser SerScr LeuAsn GljI^rSer Ala Asn Glo 
100 105 liO 

Ser(^AqiAlaIjsI^SerLjsIjiGI]rProAl8ProAisValGlo 
IIS 120 I2S 

Gin ClB Leu Gp Leu Val 61; Asp Ai^ Ala Ser CI; 1)t His lyr 
130 135 140 

Asa Ala Leu llr Cfi Gl; Ser Cp Lji Val Phe Phe Arg Arg Ser Val 
145 ISO 155 160 

Hv Ijs Ser Ala Val 1^ Cji Cfs Ljs Phe Glj Arg Ala Qs Glu Hel 
165 170 175 

Asp Net 1)t Met Arg Arg Ljs (llfs Gla Glu C)[s Arg Leu Ljs Ljs Clfs 
ISO 185 190 

UuAbVal G(rHetAr{Pro GlD^ Val Val PnGluAsn Gli(!|i 
195 200 205 

Ala Hel Ljs Arg Atg Glu Ljs Lys Ala Glo Ljs Gla A^ LjsHet 
210 215 220 

llirllirSerProSerSerGlnHisGljG^AsnGljSerLeaAlaSer 
225 230 235 . 240 

61; G|; 61; 6hi Asp Phe Val L;s Ljs Glu lie Uu Asp Leu llet nr 
245 230 2S3 

iji 6lD tn Pro 6ln His Ala llir De Pro Leu Leu Pro Asp Gin He 
260 265 270 

Leu Ala L;8 C}i Gin Ala Arg Asn He Pro Ser Leu llir 1)t Asa Gin 
275 2S0 2S5 

Leu Ab Val He Ijrr L;s Leu He IVp Ip Gin Asp Gl; T)t Gin Gin 
290 295- 30O 



Pro Ser Gin do Asp Leu Ais Arg lie Met Ser Glo Pro Asp Gin Asn 
305 310 3IS 320 

GiD Ser Glo Hit Asp Val Ser Phe ktg M He Hit G|g lie Hit De 
325 330 335 

Ud nr Val Gin Lea He Val Gla Fhe Ala Ljs Glj Lea Pro Ala Phe 
340 345 350 

llirl^OeProGlnGluAqiGlnllellirLeDLeaLjsAlaCpSer 
3SS 360 365 

SerGlaValHelHetLeoArglletAlaArgAifl^AsplIisSerSer 
370 375 3S0 

Asp Ser lie Pte Pbe Ala Asn Asn Aif Scr T)fr Thr Arg Asp Scr 1)r 
m 390 393 400 

LjsHetAlaGlrHetAlaAspAsnlleGlnAspLeuLcuHisPiieCrs 
405 410 415 

Arg Gla Hel Flie Ser Hel Lji Val Asp Asn Val Giu 1^ Ala Leu Leu 
420 425 430 

HirAlaDeVaineFteSerAspArgProGljfUuGluLjsAIaGIn 
435 440 445 

ku Val Glu lb Ue Gin Ser l^r Tp De Asp Hit Leu Arg lie 1)t 
450 455 460 

De Lei An Aig His Ijis G|]i Asp Ser Ser Lei Val Phe Tir Ala 
465 470 475 430 

Iji Leu Leo Ser De Len Tlir Gla Leu Arg llir Len GIf Asn Gb Asn 
4S5 490 495 

AiaGliHeiljlsPbeSerLeil^LealjiAsnArgLjiLeaPralji 
500 505 510 

Phe Lei Gin Glu He ftf Asp Val His Ala De Pro Pro Ser Val Gin 
515 520 525 

Ser His Leu Glo De Thr Glo Glu Glu Aso Glu Arg Leu Glu Arg Ala 
530 535 540 

GloArgHelArgAbSerValGljGljAlaDe'nirAlaGljDeAsp 
545 550 555 560 

(^AspSerAbSernrSerAbAhAbAbAbAbGtaHisGta 
565 S7D 575 

PnGbProGtaProGloProGbProSerSerLeollirGbAsnAsp 
510 5S5 590 

Ser Glo His Gin Hit Gin Pro Glu Leu Gin Pro GInLeu Pro Pro Glo 



995 600 60S 

Lea Gla Glj Gin Leu Glo Pro Gin Leo Gin Fro Gin Len Gin llir Gin 
610 6IS 620 

In Gin Pro Gin lie Gin Pro Gin Fro Gin Len Len Pro Tal Scr Ala 
625 630 635 610 

Pro Val Pro Ala Ser Val Ihr Ala Pro Glj Ser Leo Scr Ala Val Scr 
m 650 655 

Hit Ser Ser Gin Tjr Net GljGljf Ser AlaAla lie Gl; Pro Dellir 
660 665 670 

Pro Ala % nir Ser Ser lie Tbr Ala Ala Val Hit Ala Scr Ser Hir 
675 6S0 6S5 

nvSer Ala Val Pro HclGljAsn Glj Val Glj Val G|j Val Glj Val 
690 695 700 

GfjCljAai Val SerHe(1)TAIaAtn AlaClnThr Ala HetAlaLen 
7IH 710 . 715 720 

' Ucl Glj Va! Ala Leu His Ser His Glo Glu Gin Len lie Glj Glj Val 
725 730 735 

Ala Val Lji Ser Clu His Ser Hit Hir Ala 
740 745 



(2)IKF0RH4'n0NF0RSEQn)N0:6: 

(USEQIKNOEGEyiRAGIBtlSniS: 

(A) lA'GTfl: 2211 base pain 

(B) 'nTE:nudeicacid 
(OSIfiANDEDIKbolli 
(D)nFOLOCr:bolb 

QUHOimEnTKcDIU 



WIUHE/KEl-OIS 
(B)U)CA'nON:L2241 

(D) OIHER IKFORlUnOlt /prodiii;l= I'pEcR" 



di) SEQUEKCE DBSOIIPIIOIII: SGQ D M 

AT6 GOC COC COG ACC GAT GTC A6C CTG CGG GAC GAG CTC GAG TTA GAG 
Met Ala Pro Pro Hit Asp Val Ser Leo Glj Asp Gin Leo His Len Asp 
15 10 15 

6GC GAG GAC GT6 GCG AT6 GCG CAT GOC GAC GCG OTA GAC GAT TTC GAT 
GljGloAspValAblleiAlaHlsAlaAspAlaLeoAspAspPbeAsp 
20 25 30 



CTG 6AC ATG TTG GGG GAG GGG GAT TCC GCG GGT CC6 GGA TIT ACC CCC 144 
l«n AtplIelLeo Glf Glj AspSer FraGljPn GljPhe nirPro 
35 40 4S 

CAC CAC TCC CCC CCC TAC CGC CCT CTG CAT ATG CCC GAC TTC GAG TIT 192 
HisA^Ser AlaPro Tjt GljAlaLeu AspUclAla Asp Plic GluFbc 
SO SS 60 

GAG GAG ATG TFT ACC GAT GGC (TT GGA ATT GAC GAG TAC GGT GGG AAG M 

ClnGlaHetPkeT1irAspAlaLeu6ljIleA8pGlBT|T6l]fGljl^ 

CS 70 75 80 

m m GGT ACC TCT ACA AGG ATA TCG AAT TCT ATA TCr TCA GGT GGC iU 
Leo Leu Gl; Hit Ser Arg Arg lie Ser Ata Ser De Ser Scr Glj Arj 
SS 90 95 

CATGATCrCTCGCerTCGAGCAGCTTGAACGGATACTCGGCGAACGAA 336 
lip Asp Leo Ser Pro Set Ser Ser Lea Asn Glj 1)|r Ser Ala An Glo 
100 105 110 

A6CTCCGAT GCG AAG AAG AGGAAG AAG GGA CCTGCG CCl CGG GTG GAA 384 
Ser A^ Ala Ljs Ljs Ser Ijt hp Gl; Pro Ala Pro Arg Val Gin 
lis 120 125 

SAG GAG CTG TGC CTG GTT TGC GGC GAC AGG GCG TCC CGC TAC CAC TAC 432 
GloGlD Leo CysLeu Val tji GljAsp ArgAlaSer GljlJfrHii^r 
130 135 140 

AACGCCCTCACCTGTGAGGGCTGCAAG GGG TTC TIT CCA CGC AGC GTT 4S0 

AtnAlaLeaTlirCpGlnGijCrsLjsGljPhePlieAriArgSerVal 

145 ISO 155 160 

ACGAAGAGGtOCGTCTACTGCTGCAAGTrCGCGOGCGCCTGCGAAATG 528 
llr Ljs Ser Ala Val T]T Gp Cp Lj8 Pbe Glj Arg Ala GId Het 
165 170 1^ 

GAC ATG TAC ATG AGG GGA AAG TGT GAG GAG TCC CGCPTG AAA AAG TGC 576. 
Aip M fjt He( Arg Arg Ljs C|fs Glo Glu Cfi Arg Lea Ljfi Ljs Pus 
180 185 190 

CrGGOCGTGGGTAieGGGGCGtAATGCGTCGTCCOGGAGAACCUIGT 624 
Uo Ab Val GIf Het Arg Pro Gla Val Val Pro Glo An Glo 
195 200 205 

GOG ATG AAG CGG GGC GAA AAG AAG GCC CAC AAG GAG AAG GAC AAA ATG 672 
Ala Hel L}i Arg Arg Glo Ljs Ijt Ala Gli Ljs Glo Ijt Asp Ljs lie! 
210 215 220 

AGCACrTCGGCGAGCKTGIGCATCGCGGCAATGGCAGCTniGCCTCr 720 
nr nr Ser Pro Ser Ser Gin His Glj Glj Aso Gij Ser Leu AlaSer 
225 230 235 240 

GGTGGCGGCCAAGACTrrCnAAGAAGGAGATrCrrGACCrrATGACA 768 
Glj Glj Glj Gin Asp Phe Val Ljs Ljs Glo Ue Uo Asp Leo Het Hir 
245 250 255 



66 



TGCIUGCCGCCCUGCATeCCACTATrCCGCrAn'ACCTGATGAAATA SI6 
GId Pro Pro Gin His Ala llir lie Pro Lea Leo Pro Asp Clo lie 
2S0 26S 270 . 

TTGGCCAAGfGTCAAGCGCGCAATATACCTTCCnAACGTACAATCAG «M 
Lea Ala Ljs Cjfs Gin Ala Arg Atn lie Pro Ser Leu Thr T)t Aso Gin 
273 m 2S 

TTG GCC Cn ATA TAG AAG HA AIT TGG TAG GAG GAT GGC TAT GAG GAG 912 
Lea Ala Yal De fjr lj» Lea He Itf tjt Gin itffifjtp Gla Gin 
290 29S 300 

CCA TCP GAA GAG GAT CTC AGG CGT ATA ATG ACT CAA CCC GAT GAG AAC 900 
Fro Ser Gin Glu Asp Lea Atg Arg He Met Ser Gin Pro Asp Gin Asi 
m 310 3IS 320 

GAG ACC CAA ACG CAC 6TC AGC TFr CGG CAT ATA AOC GAG ATA ACC ATA lOOS 
. GId Ser Gin llir Aip Val Ser Pbe Arg His lie llir Gta lie fkt lie 
325 330 33S 

CTCACGGTCCAGTrGATrGTrGAGTrrGCrAAAGGTmCGAGGGITT IIIS6 
Lea nir Val Gin Leu lie Val Gla Phe Ala Lji Glj Leu Pro Ala Pbe 
340 345 350 

AU AAG ATA OX CAG GAG GAG GAG ATC ACG TPA CPA AAG GCC TGC TCG 1 1 04 
ItrLjsDeProClnGliAipCtaDe'nirLeaLealjiAlal^Ser 
355 360 365 

TCG GAG GTG ATG ATG UK C6TATG GCA CGA CCC TAT GAC CAC AGG TOG 1152 
SerGluValHelHelLeuArgUetAlaAisArgTfrAspHisSerSer 
370 375 3S0 

6ACTGA1TA TrCTTCGGG AATAATAGATGA TATAOGCGG GATTITTAC 1200 
A^ Ser Be Phe Phe Ab Am An Arg Ser 1^ Itr A rg Asp Ser 1)r 
335 390 395 400 

AAAATGGGCGGAATGCn'GATAACATrGAACACCTGCrCCATTrCTGG 1243 
Ljs Met Ala Glj He( Ala Asp Asii lie Gin Asp Leo Lea His Phe C;i 
405 410 415 

CGCCAAATGTRTCGATGAAGGTCGACAACGTCGAATACGCGCTrcrC 1296 
Aig Gin M Pbe Ser Het Iji Val Asp Am Val Glu 1^ Ala Leu Lea 
420 425 430 

ACrGCCAnCTGATCTTCTCGGACCGGCCGGGCCrGGAGAAGGCCGAA 1344 
llr Ala De Val De Phe Ser Asp Arg Fro Glj Leu Gla Iji Ala Gin 
433 440 445 

CPA GTC6AA GOi ATCGAG AGCTACTAC ATCGACAC6 CU CGCATPTAT l»2 
Len Val Gin Ala He Gin Ser lyr Tp lie Asp nr Lea Arg De lyr 
450 455 460 

ATA CrC AAC CCC CAC TGC GGC GAC TU ATG AGC CTC GTC TPC TAG GCA 1440 
He Len Am Aig His C^ Glj Asp Ser Het Ser Leu Val Pbe Tjt Ala 
463 470 475 430 



UemCrCTCGJITCCTCACCIUGCrGGGTACGnVGGCAACClGAAC t4S{ 
L}s Leu Leo Ser Ue Lea Hit Glu Leu Arg Hit Leo Glj An Gin itn 
ili 490 495 

GCCGAGATGmTrCTUCTAAAGPrCAAAAACCGCAAACrGCCCAAC 1536 
Ala Glu Hel Pbe Scr Leu Ljs Leo Asn Arg Ljs Lea Pro Lji 
500 505 510 

TrCCrCeACGAGATCTGeGACGITCATCCGATCCCGCCITCGGTCGAG IS84 
PbeLeaGloGlnDelVpA^ValBiiAlalleFroProSerValGlo 
SIS 520 525 

TCGCACmCAGATrACCCAGGAGGAGAACGAGCGTPrCGAGCGGCCr 1632 
SerHuLenCloDellirGlnGlgGlDAsnGlaArgLeuGlaAisAla 
530 S35 540 

GAGGGTAlGCGGGCATCGGITGGGGGCGCCATrACCGGCGGCATrGAT 1680 
GiD Arg Het Arg Ala Ser Val Gl; Gl; Ala De Itr Ala Gljf De Asp 
515 550 555 560 

TGCGACTCTGCCTCCAarTCGGCGGCGGCAGCCGCGGCCGtGGATGAG im 
CpA^ Ser Ala Ser Hit Ser Ala Ala Ala Ala Ala Ala Glo His Glo 
565 570 5^ 

CCrCAGGCrGAGGCGCAGOCCCAACOGmmCrGAOCCAGAACGAT 1776 
Pro GlD Pro Gin Pro Gin Pro Gin Pro Ser Ser Lei llr Glo Asa Asp 
SSO 5S5 590 

mUGCACCAGAUCAGCCGCAGCTACUCCTCACCrACCACCrCAG 1S21 
Ser Gin His Gin Ibr Gin Pro Gin Leo Gin Pro Gin Leu Pro Pro Gin 
595 600 605 

CIGCAAGGTGAACIGCAAOOCCAGCrCClACGACAGCrrCAGACGCAA 1S72 
Leu Gin Gl; Gin Leo Gin Pro Gin Leo Gin Pro Gin Leo Gin Thr Gin 
610 61S 620 

CrCGAGCCACAGATTCAACCACAGCCAGAGCTGCrrCGCGTCTCCGCr 1920 
Len (In Pro Gin De Gin Pro Gin Pro Gin Leo Leo Fro Val Ser Ala 
625 630 635 640 

OOCGTGOIXGCCTGGGTAACCCCICn'GGTTCCmTOCGCGGTCAGT 1963 
Pro Val Pro Ala Ser Val Ik Ala Pro Gljf Ser Leu Ser Ala Val Ser 
615 630 655 

ACG AGC AGCGAA TACATG GGC66A AGTGCG GCCATA GGA CCCATCACG 2016 
llirSerSerGlullfrHelGljGljSerAlaAlaDeGlrProllellir 
660 665 67D 

GCGGGtAOCAOCAGCAGTATCAOGGCrGGCGTrACCGCTAGCTCCACC 2064 
ProAhHirTlirSerSerllellirAlaAlaVainirAlaScrSerlln' 
675 630 635 

ACA TGA GC6 GTA CCG AT6 GGCAACGGA GTTGGA GTCGGTGnGGG GTG 2112 
nr Ser AlaM ProHet Gly Asn Glj Val Gl; Val Gl; Val GI;Val 
690 695 700 



68 



GGCGGCAAC6TCAGCATGTATGC6AACGCCUGAC66CGAT6GGC1TG 2160 

GlfGljAaiValSerHetiyrAlaAaiAlaGlnllirAlaHelAlaLcn 

705 71t 7iS 720 

ATG GGT GTA GCC CTG CAT TCG CAC CAA GAG CAG CTT ATC CGG GGA GTG m 
M Gij Val Ala Uu His Set Uis GId Gin Gin Len lie Gl; Gij Val 
725 710 719 

CCGG1TAAGTCGGAGCACTCGAC6ACTGCATAG 2241 
Ab Val Lji Ser Glu His Ser llr Hit Ala 
740 745 



(2)IIIIFORMA110KrORSGQIDNO:7: 

(USGQDEKCECHARACIERimCS: 

(A) 1131161^:746 amino adds . ' 

(B) TYPE: amino add 
(D) TOPOLOGY: linear 

QOHOUCIIUTYr&pniteln 

(li) SEQDEilOE DESCRIFHOII: SEt ID K0:7. 

Het Ala Pro Pro Tlir Asp Val Ser Leu Glj Asp Glu Lcn His Lea Asp 
15 10 15 

Gif Gin Asp Val Ah llel Ala llisAlaAsp AlaLen Asp Asp Fhc Asp 
20 25 30 

Ln Asp Hel LeoGlj Asp GljAsp Ser Pro GljPro CljPbeTlir Pro 
35 40 45 

His Aqi Ser Ala Fro lyr Glj Ala Len Asp Uet Ala Asp Fbe 61a Fbe 
SO 55 60 

GliGlnlMPIiellirAvAlaLenGljIleAspIlolJrGljCljLji 
65 70 75 to 

Lei Len Sir nir Ser All Ari De Ser Asn Ser He Ser Ser GIr Arg 
S5 90 95 

Asp Asp Len Ser Pro Ser Ser Ser Len Asn Gl; lyr Ser Ala AsD Glu 
100 105 110 

Ser Asp Ala Lji Ljs Ser Ljs lip Gl; Pro Ala Pro Atg Val Gin 
115 120 125 

Glu Gin Lea CfsLen Val Glj Asp Aij Ala Ser Glj lyr His tjt 
130 135 140 



Aso Ala Leu nir Glu Glj (jis 1^ 6l)f Phe Phe Arg Art Ser Val 
145 150 155 160 

11irI^SerAlaVal1^G;sQ|8ljiPlieGljArgAlaC]nGluHe( 



16$ I7D ITS 

Aq)Heliyr Hel Arg Arg L}f GjfsGIn Gla Cji ArgLca Lji Ljs(^ 
ISO iSS 190 

UaAlaVal G|f HclAisProClaC|fsVal Val Pro tin Asn Gin Dp 
I9S 200 203 

Alailetl^AisAis6laLjsLj8AlaGlnlj«GlaLj!A8pLpUel 
210 2IS 220 

llirllirSerProSerSerGlnHKGljGijfAfliGljSiirLeuAlaScr 
223 230 ' 233 240 

Glj Gl; Glj Glo Asp Fbe Val Ljs Ljs Glu lie Leo Asp Leu Hcl Hit 
243 230 233 

1^ Gig Pn Pro Gin Hit Ala Hit lie Pro Leo Leo Pro A«p Gin Oe 
260 263 270 

Lea Ala Ijs Gin Ab Arg Asa Oe Pro Ser Leo Hr 1)r Affl Glo 
27S 230 235 

• Leu Ala Val De I^Ljs Leu lie IVp IjrGlaAsp Gfrl^rGlo Glo 
290 295 300 

PnSerGlaGlnA8pLeaArj|AiglleltetSerGlnProAspGlnAfli 
305 310 315 320 

Glu Ser Glo llir Asp Val Ser Fbe Arg His lie Hit Glu lie Hit lie 
3S 330 335 

Leu Itr Val Gin Leo lie Val Glu Pbe Ala Ijs Gl; Uu Pro Ala Pte 
340 345 350 

11irL}sDeProGlaGluAq)ClaDe'nirUaLeuL|sAla(^Ser 
335 360 365 

SerGlaValHeiHe(LeaAi{lletAbArgArg1)rAspllisSerScr 
370 375 330 

AspSerllePlKPIieAlaAsoAsaAigSer1)rnrArgAspSer1)r 
335 390 395 400 

Iji He( Ala Glj Hel Ala Asp Asn De Glu Asp Leu Leo His Phe (jji 
405 410 415 

ArgGlnHelPkeSerHelLjiValAspAsnValGlaiyrAlaLcaLeu 
420 425 430 

llirAlalleValllePlieSerA^ArgProGljLeaGloLjsAlaGIa 
435 440 445 

Lea Val Glo Ala He Glo Ser 1)t 1)r lie Asp Hit Leu Arg lie Tyr 
450 435 460 



DeUo An Arg HuGfiGljJUpSer HetScr Lea Val Pbel^r Ala 
465 470 4^ m 

L}i Ud Leo Ser lie Leu llir Gin Leu Arg Hit LCD Gij AsD Gin Aa 
m 490 495 

Ala Glu Met Cp Phe Ser Leu Ljs Leu Ljs Am Arg Ljs Leo Pro Lji 
500 516 510 

Fhe Leu do Glu Ue Itp Asp Yal Hu Ala lie Pro Pro Scr Val Gin 
515 520 525 

Ser His Leu Gin De Thr Gin GiD Glu An Gin Arg Leu GId Arg Ala 
530 535 540 

GigArgHetArgAlaSerValGljrGijAlalie'nirAlaGiylleAsp 
515 550 555 560 . 

CjiAspSerAlaSerThrSerAlaAiaAlaAlaAlaAlaGlDllitGin 
565 570 575 

Pro Gin Pro Gin Pro Glo Pro Gin Pro Ser Ser Lea Itr Gin Asa Asp 

580 5S5 590 

Ser Gin His Gin Hit Gin Pro Gin Lea Gin Pro Glu Leo Pro Fro Gin 
595 600 605 



Lei Gin 61; Gin Lea Gla Pro Gb Lea Gla Pro Glu Leu Gin Itr Gli 
610 615 620 

Lea Gin Pro Gin Be GId Pro GiD Pro Gin Lei Leu Pro Val Ser Ala 
625 630 635 640 

Pro ValPro Ala SerTal Hir AlaPro GljSer LenSerAlalU Ser 
645 650 655 

UrSerSerGln^HetGljGlrSerAlaAhneGljPronellir 
660 665 670 

Pro Ala Itr Hit Ser Ser De Hit Ala Ala Val Hit Ala Ser Ser Hit 
675 6S0 6S5 

Hit Ser Ab Val Fro Hel Glj Asn Glj Val Glj Val Clj Val Glj Val 
690 695 700 . 

GljCljAsnValSerlletl^AlaAsaAlaGlnllirAlaHetAiaLeu 
705 710 715 720 

Het Glj Val Ala Un His Ser His Glu Glu Gin Leu He Glj G|f Val 
725 730 735 



Ala Val Ser Glu His Ser Itr Hit Ala 
740 745 



(aiKFOBlU'nONFORSEQDNM 



(i)SEQljEK(XaURilCIWnCS: 
(ULGKGIH: 3126 base pain 
(B)TffE;nacleicacid 
(OSmUVDEDNES&bolh 
ffl) TOPOLOGY: bolb 

(iDUOUCilUnTKcDM 



U)IUWKEl':CDS 
(6)10041101): IJ126 

(D) mm IXFOIUUIIOIII: /pro(liic(= "GM" 



(d) SEQDEh'CX DESCRmON: SEQ D) N0:& 

ATGGACTCCAiUGUTCinAACTCCrGGTAGilGiUGUAACCOCAGC ii 
M Asp Ser Ljs Gla Ser Leg nr Pro Glj Aij Ota GId Am Pro Ser 
1 5 10 IS 

AGTGTGCrrCPrClGGAGAtGGGAGATCTGATGGACmnTAAAAOC 91 
SerViILeaAlaGlnGliArsGljAtpralHeiAspPheTiTLjsllir 
» 2S 30 

mAGAGGAGGAGin'Ai;rGn;AAGGTrTn'G(xin'TCAcn;TCAcrG m 

LegArgGljGljAlallirValLjiValSerAbSerSerPraSerLHi 
35 40 45 

GCTGTCGCriUrCAATCAEACTIXAAGCAGCGlAGAGrriTGGnGAT 192 
AlaValAlaSerGloSerAspSerLjaClnArfAisLeaLeoValAv 
50 55 60 

TTrCCAAAAGGCTCAGTAAGCAATGCGGAGCAGGCAGATCTGTOCAAA 240 
Pk Pro Lji G|r SerValSer An Ab Gin Gla Pro A<p Leo Serial 
65 » 7S to 

GGAGTTTGAITCTCAATGGGAPnimATGGGAGAGACAGAAAGAAAA 2U 
AlaValSerUDSerHetGljUaiyrHe(Glj6lDl1irGta11irIjFi 
S5 90 95 

GTG ATG GGA AAT GAC CTG G6A TTC CCA GAG GAG GGC CAA ATC AGC m 336 
ValHetGljAinAspLeaGlyrheProGlnGIiiGljGlnneSerLea 
100 105 110 

TCCTCGGGGGAAAGAGACTTAAAGmiTGGAAGAAAGCATrGCAAAC 3S4 
SerSerGljGtanrAspLeuijsLeiLeaGtaGtaSerneAlaAn 
lis 120 125 

CrCAATAGGTCGACCAGTGITGGAGAGAACGCCAAGAGTTCAGGlTCC 432 
Leg Am Arg Ser nir Sff Val Pro Glo Asn Pro Ljs Ser Ser Ala Ser 
130 135 140 



ACT C(TGTG TUT GCr GCC CCC AU GAG AAG GAG TTT CCA AAA AH'CAC m 
Hit Ab Val Ser Ala Ala Pro llir Glu Ljs Glu Fhe Pro Lfs Thr His 
US 190 155 160 

IVTGATGTA mia GAA GAG CU UTTIt! AAG 6GCC4G ACTGGCAOC 52S 
Ser Asp Val Ser Ser Glu Gin Gin His Lea Ljs Glj Gin Thr Glj Thr 
165 170 173 

AACGGTGGCAATGTGAAATTGTATACCAGAGACCilAAGCACCTrrGAC 576 
Asn &j Glj Asn Val L)ib Leo T)t Thr Thr Asp Glo Ser Ttr Pbe Asp 
ISO m 190 

ATT TIC CAC GAT TTG GAG TIT TIT TN GGG TCC CU GGT AAA GAG AGG 624 
De Leo Gin Asp Leo Gin Fhe Ser Ser Gl; Ser Pro GljL]^ Glullir 
195 200 205 

AAT CAG.AGT CCT TCG AGA TU GAC CTC TO ATA GAT CAA AAC TCT TTG 672 
Asn Glu Ser Pro Trp krg Ser Asp Leu Leu lie Asp Glu Asn Leu 
210 215 220 

CnTIT(XT(TG6(SC6A6AAGACGATTC4TTC(TrTnfGAAGGAM^ 720 
Leo Ser Pro Leu Ala GIf Gin Asp Asp Ser Pbe Len Leu Gin Glf Asn 
225 230 235 240 

TOG AATGAG GACTGC AAG GGTCrCATTTTA CCG GACACTAAA CCCAAA 768 
Ser Asn Gin Asp Cp Ljs Pro Len He Leu Pro Asp Thr Ljs Pro Ljs 
245 250 255 

An AAG GAT AAT 6GA GATCTG GTT TTG TC4 AGC GOCAGT AATCTA ACA SI6 
ne L}s Asp Asfl Glj A^ Len Val Leu Ser Ser Pro Ser Asn Val Tlir 
260 265 270 

ntaXCAAGTEAAAACAGAAAAAGAAGATTTCATCGAACTCTGCACC m 
Len Pro Gin Val Lji llir Glu Ljs Glu Asp Pbe lie Gin Leu Hir 
275 2N 285 

0CTC6G 6TA ATT AAG CAA GAG AAA CTG GGG AGl CTTTACTCTCtG GCA 912 
Pro Glj Val De Ljs Gin Gin Ljs Leu Glj Ttir Val 1^ Cp Gin Ala 

290 295 300 

AGG ITT GCT G6A GCA AAT ATA ATT GGT AAT AAA AT6 TITT GGG ATT TCT 960 
Ser Phe Pro Glj Ala Aso lie lie Glj Asn Ljs Uei Ser Ala lie Ser 
305 310 315 320 

GTTaTGGTGTCACTACCTITGGAGGAGAGATGTAGCIGTATGACATG 1008 
Val His Glj Val Ser Tbr Ser Gtj Glj Gin Met 1)t His T|r Asp llel 
325 330 335 

AATACAGUTCCCnTlTCAACAGCAGGATGAGAAGOGrATrnTAAT 10S6 
Asn llir Ala Ser Len Ser Gin Gin Gin Asp Gin Ljs Pro He Phe Asn 
340 345 350 

GTCAnCCACCAArrCCCGnGGTTCCCAAAATTGCAATACGTCCCAA 1101 
Val He Pro Pro He Pro Val Glj Ser Gin Asn Trp Asn Arg (^fs Gin 
355 360 365 



CCA TCr CCA CAT CAC AACTTC ACT TCT CTC GCC ACT CTC AAC TO GCT 1 1S2 
Clj Ser Clj Atp Asp Asn Leu Hit Ser Leu Glj llir Leu Asn Fhe Pro 
370 37S m 

GGTCGA AQ GirTITTCr AAT6GCTATTCA AGCCOCAGC ATG AGA CCA 1200 
GljArsllirVal PheSer Aa Gljljr ScrScr Pro Ser Net ArgPro 
3SS 390 395 400 

GATGTAA&CTCTCCTGGATDCAGCTCCTCAAGAGCAACAACAGGACCA i24S 
Asp falser Ser Pro Pro Ser Ser Ser Ser Hit Ala ThrllirGlr Pro 
409 410 415 

CCTOCCAGCGGCCGCGTGCAAGAGGAGPrGTCCPrGGTrTCCCCCGAC 1296 
Pro Pro Ser Glj Ai| Val Gig Glo Cla LeiC)i Lea Val Glj Atp 
420 425 430 

ACG GCC TCC G6C TAG CAC TAC AAC CGC GTC ACC TGT GGA TCC TGC AAG 1344 
Ais Ala Ser Clj llfr His Tjfr Asn Ala Leu Hit C|)[8 Gljr Ser Cp L}s 
435. 440 445 

GTGTOTTTGGACGCAGCGITAOGAAGAGCGOCGTCTACTGCTGCAAG 1392 
Val Fhe Phe Arg Arg Ser Val nir hp Ser Ala Val 1)t Cp C)fs Iji 
450 455 460 

m COG CGC GCC TGC CAA ATC CAC ATG TAC AT6 ACG CCA AAG TGT GAG 1440 
Pbe Gl; Arg Ala Cys Clu Net Asp Het 1)t He( Arg Arg L}8 Glo 
465 470 475 430 

GAGTGCCGCCTGAAAAAGTGCCrGGOCGTGGCTATGGGGCCGGAATGC 1433 
Glu Cfi Arg Leu Lj8 Ljs Cys Leu Ala Val Glj llel Arg Pro Gin Qs 
4SS 490 495 

GTCGTCGCGGAGAACCAAT6TGCGATGAAGCGGCGCGAAAAGAAGCCC 1536 
ValTalProCloAsneinCpAlaHetLjiArgArgGluLjsLjsAb 
500 505 510 

CAGAAGGAGAAGGACAAAATGAGCACrTCGOGGAGCTCTGAGGATCGC 1534 
GId Ljs Glo Ljs Asp Lji He( nr Tbr Ser Pro Ser Ser Gin Dis Clj 
515 520 

GGCAATCGCAGCTrGGCCTCrCCTGGCGCCCAAGACTITGTTAAGAAG 1632 
Glj Am Glj Ser Lm Ala Ser Glj Glj Glj Gin Asp Phe Val Ljs Ljs 
530 535 510 

CAGATrmCACCiTATGACATGCCACCCGGCCCAGGATGGCACTATr 1630 
Gin De Leu Asp Lea Hel Hit C|s Gin Fro Pro Gil His Ab Hit ne 
545 550 555 560 

CCGCTAmOCTGATCAAATATrGGCCAAGTCTCAACCGCGCAATATA 1723 
Pro Leu Lea Pro Alp Gin ne Leu Ala Ljs Gin Ala Aig Asn Oe 
565 Sn 57S 

OCT TCC TTA ACG TAC AAT CAC ITG GCC GH ATA TAC AAC TTA ATTTGC 1776 
Pro Ser Len Hit 1]fr Aso Gin Leu Ala Val lie 1^ Ljs Leu lie Itp 
530 535 590 



m m GAT GGC TAT GAG CAC CCA TCT GAA GAG GAT CTC A6G CGT ATA 1824 
ly GId Asp Glj 1)T Gla 6lo Pro Scr 6Id GId Aqi Leu Arg Ar| lie 
59S 600 605 

ATGAGTCAACOCGATGAGAACGAGAGCCtAACGCACimiAGCTITOGG IS72 
Hel Ser Gin Pro Asp Glu Asn Glu Scr Gin Tlr Asp Val Scr Pbe AiJ 

610 615 620 

CAT ATA A0C6AG ATA AOC ATA CTCACG GTCCAG TTG ATrGTTGAG TFT 1920 
His De Thr Glu lie nr lie Leo Thr Val Gin Lea lie Val Gin Fhe 
623 630 63S MO 

GGT AAA G6T CPA CCA GCG TIT ACA AA6 ATA CCC CAG GAG GACGAG ATC I96S 
Ala 1^ Glj Leo Pro Ala Pbe llr Ljs lie Pro Gin G|g Asp Gin He 
615 650 655 

AGG TTA CTA AAG GCC TGC ICG TCC GAG GTG ATG ATG CPG CCTATG GCA 2016 
11irLeaLeuLjsAlaC]sSerSer6lnValUe(IIc(LeaArgUe(Ala 
660 665 67D 

CGAOCCTATGACaCAGCTCGCACTGAAnmiTCGCGAATAATAGA 2064 
Arg Arg Tjr Asp His Ser Ser Asp Ser ne Phe Plic Ala Asn Aai Arg 
6^ 6S0 6S5 

TCA TATACG GGG GATTPTTAC AAA ATG GCCGGA ATG GCfGATAACATT 2112 
SerlJrThrArgAspSerT^LjsHelAlaGljHetAlaAspAsnDe 
690 .695 m 

GAAGACCrCCPGCATTICTGCCGCCAAATGTrCTCGATGAAGGTGGAC 2160 

GlnAspUnLeallsFbeCjiArgGlnUetPiRSerHelljiValAsp 

705 710 715 720 

AACCTCGUTACGCGCrrCTCACrGGCATrGTGATCTTCTCGGACGGG 220S 
AmValGlnlJrAbLenLeullirAlaDeValllePlKSerAspArg 
725 730 735 

COGGGGPRGACiUGGCCGAACrAnCGUGCGAICGIGAGCTACTAC 2256 
ProGlrLenGlnl^AlaGlnLeDValGloAlaneGlnSeriyriyr 
740 745 750 

ATC GAC ACC CTA CGC ATT TAT ATA GTC AAC C6C CAC TGG GGC GAC TCA 2304 
He Asp Hit Len Arg Ue Tfr He Leo Asn Arg His C^ Gij Asp Ser 
755 760 765 

ATCAGCCrCGTCTTCTACGCAAAGCrCCrCTCGATCCTCACCGAGnt; 2352 
Het Ser Leu Val Plie Tfr Ala Ljs Leu Len Ser He Len llr Gin Len 
770 7ti 7S0 

CGTACGPKGGCAACCAGAACGCCGAGATGTGTTrCTGACTAAAGCTC 2400 

ArgllirLeaGljAaGlnilsnAlaGlnlletCjiFlieSerLeoLjsLeo 

735 790 795 too 

AAAAACCGCAAACTGCCCAAGmCTCGAGGAGATCTGGGACGITCAT 2443 
Lja Asa Atg Lji Len Pro Ljs Fhe Len Glu Glu He ttf Asp Val His 
305 310 315 . 



75 



IHXi1i;GCGCC41CGGTCaG1CGC4C(TrGil6A'rrACCC4GCA6UG 2496 
Ala ne Fro Pro SerValGloSer His Lea Clone Hit GlnGlo Gin 

m m m 

AAC GAG CGT CTCCAG C6G GCTGAG OGTATG GGG GCA TCG GIT GGG GGC 25i4 
Am Gin Ar; Leu Gli Arg Ala Gin Aig llel Arg Ab Scr Vai GljGlr 
S3S S40 845 

GGCATrACCGGCGGCATrGATTGCGAGTin'GOCTCCACrTCGGCGGCG 2S92 
Ala ne Hit Ala Glj lie Aspics Asp Ser Ala Sernr Scr Ala Ah 
190 8SS StO 

GCAGCCG06GGCCAGCATGAG0GrGAG(CTGAGCCCaGG0CGAA0CG 2610 
Ala Ah Ah Ah Gh Hh Gta Pro Gta Pro Cta Pro Gin Pro Gin Pro 
S65 870 tH m 

TCCTCCCTGAOCUGAAGGATTCCGAGGACCAGACAGAGCCGGAGn'A 26U 
Scr Ser Ui nr Gin An Aip Scr Gh His Gh Hv Gin Pro Gh leu 
ns S90 S9S 

CAAOCrCAGCrAOCACCrCAGCRCAACGTCAACTGliUGOCGAGCn; 2736 
Gh Pro Gh Len Pro Pro Gh Lei Gh GIf Gh Lei Gh Pro Gh Leo 
900 90S 910 

CAAOCACAGCrrCAGACGCAACTCCAGOCAGAGATrCAACCACAGCCA 27X4 
Gh Pro Gh Len Gh Hit Gh Leo Gh Pro Gh Ite Gh Pro Gh Pro 
91S 920 925 

CiGnCirrHXGTCTCCGCTCCCGTGCCCGOCTCCGTAAOCGCAOin' 2832 
Gh Len Len Pro Vai Ser Ala Pro Yal Pro Ab Ser Val nr Ab Pro 
930 935 m 

GGT TCG TTG TCG GGG GTC AGT ACG AGC AGC GAA TAG ATG GGC GGA ACT 2880 

GI]rSerLeiSerAbValSer11irSerSer6h1)rHel6(fGlfSer 

945 950 955 960 

GGC GCGATA GGA CCCATC ACG CCG GCA ACG ACC AGCAGTATCAC6 GCT 2928 
AbAbDcGlfProIlellirProAbllir'nirSerSernellirAb 
965 970 975 

GGCGnACCGCrAGCTOCACCACATCAGGGGTACCGATGGGCAACGGA 2976 
AbralThribSerSerllir'nirSerAhValProHetGlfAsnGlj 
980 985 990 

GTT GGA GTC GGT GTT GGG GTG GGC GGC AAC GTC AGC ATG TAT GCG AAC 3024 
Val GljVal Gl; Val GljVal GljGljAsn Val Ser HdV Ab Aso 
995 1000 1005 

GCG GAG ACG GOG ATG GCG TTG ATG GGT GTA GCG CTG CAT TCG GAG CAA 3072 
Ab Gh Hit Ah Het Ala Len Hel GIf Val Ab Len Bis Ser His Gh 
lOiO 1015 1020 

GAG CAG err ATG GGG GGA GTG GCG GTT AAG TCG GAG CACTDG ACG AIT 3120 
Gh Gh Len He Glj Glj Val Ab Val Ljt Scr Glu His Scr Hit Hit 
1025 1030 1035 1040 



EG4m 
Ala 



3126 



(2)DIFt)BlU1I0IIF0RS»|IDK0:9: 

(i)SEQMCE(aARA(Mini(S: 
<A)I£i61H:l01ianuiHacids 
ODIVPE: amino add 
(D) TOPOLOGY: finear 

(iOHOimsnTE protein 

(iDSEQlMCE DESOUmOlV: SEQ D K0:9: 

Mksf Ser Ljs Gin Ser Leo Tlir Pro Glj Arg Gio Gin Am Pro Ser 
15 10 15 

Ser ValLen Ala Gin Gin Ai^Gly Asp Val Met Asp Pile l^rLjiTlir 
20 25 30 

UoArgClfGljAlallirVall^ValSerAlaSerSerProSerLco 
■ 35 40 45 

Ala fal Ala Ser Gin Ser Asp Ser Ljs Gin Arg Arg Leo Leo Val Asp 
50 55 60 

Phe Prolyl 61; SerVal Ser AsoAbGlnCIn Pro Asp Len Ser Ijs 
65 70 75 (0 

Ala Val Ser Leo Ser Hel GljLeu llelGi; Gli Hit Gli llirl^ 
SS 90 95 

ValHetCljrAnA^LeaGdfPbeProGlnCloGljClnlleScrLeg 
100 105 no 

Ser Ser Glj Gin Itr Asp Leu LjsLenLenGloGIo Ser DeAla Asa 
115 120 125 

LenAsn ArgSer nr Ser Vai Pro Gin AsnPro LjiSer Ser AiaScr 
130 135 140 

Itr Ala Vai Ser Ala Ala Pro Hit Gin Ljs Gin Phe Pro Lji Hit His 
145 ISO 155 160 

Ser Asp Val Ser Ser Gin Gin Gin His Len Lji Gij Gin Hit GIj Hit 
165 170 175 

AsD 61; Gl; An Val Ljs Leu lyr Ik Hit Asp Gin Ser nr Pbe Asp 
ISO 1S5 190 

De Leo Gin Aqi Len 6ln PheSer Ser GI;Ser Pro 61; L;i 6ln nir 
195 200 216 



Aso Gla Ser Pro IVp Arg Ser Asp Lcn Lea De Asp Gin Aai C]fi Lea 
210 i\i 220 

Leo Ser Pro Lea Ala tij Gla Asp Asp Ser Fhe Leo Lea 61a Glj Asn 
225 230 235 240 

Ser AsD Clu Asp C|« Lji Pro Leo lie Lea Fro Asp Hit L}i Pro Ljs 
245 250 255 

lie Ljs Asp Asa Glj Asp Lea Val Lea Ser Ser Pro Ser Asn Val Tbt 
260 285 270 

Lea Pro Gla Val L}i Hv GId Ljt Gin Asp Fte De Gla Len Tlr 
27S 210 2IS 

Pro G|f Tal De L]i Glo Gla Iji Lea 6lj nr Val 1^ (Id Ala 
290 295 300 

Ser Pbe Fro Cpa AsB lie ne Glj Asn Lji Het Ser Ala ne Ser 
305 310 315 SO 

Val His GIf Val Ser Tlir Ser Glj Clj Gla Hel 1^ His IJfr Asp M 
325 330 335 

Asa Hit Ala Ser Leo Ser GId CId Gin Asp Gla Ljfs Pro He Phc Asa 
340 345 350 

Val He Pro Pro He Pro Val Gl; Ser Gla Asa Itp Asa ArgCfs GId 
355 360 355 

Glj Ser Gij Asp Asp Asa Leo llir Ser Lea Clj Hit Lea Asa Pbe Fro 
370 375 3S0 

Glj Arg nir Val Fhe Ser AsD G|r 1^ Ser Ser Pro Ser Met Arg Fro 
335 390 395 400 

AspValSerSerProPnSerSerSerSerlhrAlallu'llrGljPro 
405 410 415 

Pro Pro Ser G|j Arg Val GId Gla GId Leo Leo Val (!rs G|f Asp 
420 425 430 



Arg Ala Ser Glj 1^ His IjT Asa Ala Lea nir G{8 6|f Ser Cjs Lji 
435 440 445 

Val Phe Phe Arg Arg Ser Val Hv Ljs Ser Ala Val lyr C}i (Irs Ljs 
450 455 460 

Pbe Glj Arg Ala Cjs Gla Het Asp Het l^r Hel Arg Arg Ljs Cfs Gla 
465 470 475 4S0 

GlDCjiAigLeDLjsl^CjsLeoAlaValGljlIelArgPnGlDGjs 
435 490 495 



Tal Val Pro 6li An On Cjsila Met Ljslrg AtsGIn L}i Ljs Ala 
500 SOS 510 

GId L}8 Glu Lj8 Asp Ljs He( Thr Hit Ser Pro Ser Scr GId His Glj 
SIS S20 52S 

GIf AsD GljfSer Leo Ala Ser Glj Glj Glj Glo Asp Phe Val Ljiljfs 
530 SIS 540 

GId De Leu Asp Leo Het llir Cjs Glo Pro Pro GId HU Ala llir lie 
545 550 555 560 

Fro Leu Leo Fro Asp GId lie Lea Ala Ljs CjfK Gin Ala Arg Asn lie 
565 570 m 

Pro SerLeo nr Tfr Asd Gh Leo Ala Val He hp Leu lie Trp 
SSO 585 590 

1)r6lDA8pGlf1)T(anGlnProSerGlDGIaAspLeoAisAii!lle 
595 600 60S 

Het Ser Gin Pro Asp Glu Asd GId Ser Glo llir Asp Val Scr Pke Arg 
610 615 620 

HisDenr GlaDeltr De Leonv Val CliLeufle Val GId Phe 
625 630 635 640 

Ala L}B Glj Leu Pro Ala Phe llir Ljs He Pro Gin GId Asp Gin De 
649 650 633 

nr Leo Leu Ala Cfs Ser Ser GId Val Het Hel Leu Arg Ue( Ala 
660 665 670 



Arg Ais lyr Asp His Ser Ser Asp Ser De Phe Phe Ala Asn Asn Aig 
675 630 635 

Ser Ijr Hit Arg Asp Ser 1)t Ljs Het Ala Glj Hel Ala Asp Asn De 
690 695 700 

6lg Asp Leo Len His Phe Cjs Arg Gin Hel Phe Stf Hel Ljs Val A^ 
705 710 715 720 

Asd Val GInTjr Ala Leu Leu Hir Ala lie Val lie Phe Ser Asp Arg 
725 730 735 

Pro GljLeo GId Ab GIb Len Val Glo Ala DeGIn Ser Tfr 1)r 
740 745 m 

He Asp Thr Leu Arg He 1^ lie Leu Asn Arg His-(^ Glj Asp Ser 
733 760 763 

Hel Ser Leu Val Phe 1)t Ab Ljs Leo Len Ser De Leo Thr Glu Leu 
770 775 730 



ArgnrLenGljAnGloiaiAiaClaHetGiiPiKSerLeoLjiUn 
7S5 790 79S MO 

Ljs Asn Arg Ljs Lea Pro LjsPhe Leo Clu Gin De IVp Asp Val i\t 
(OS 810 (15 

Ala Oe Pro Pro Ser Val Gin Ser Du Lea Gla lie Hit Gin Glo Gin 
820 825 830 

AsoGlaAiiLeaGloAtsAlaGlDArgUctAreAlaSerYalGljGlj 
83S 140 849 

AlaDeTlirilaGljIleAvQsAipSerAlaSerllirSerAlaAli 
850 855 8(0 

Ala Ala Ala Ala Gil lb GiD Pro Gin Pro Glo Pro Glo Pro Gla Pro 
865 870 875 880 

Ser Ser Leo Itr Gin Am Asp Ser Gin HU Gin Hr Glo Pro Gin Lea 
885 890 895 

Gin Pro Gin Lea Pro Pro Gla Lea Cla GIf Gin Lea Gin Pro Gin Leo 
900 905 910 

Gla Pro Gin Len Gin Hit Gin Len Gin Pro Gin He Gla Pro Gla Pro 
915 920 925 

Gla Lea Len Pro Val Ser Ala Pro Val Pro Ala Ser Val llir Ala Pro 
910 935 910 

Glj Ser Len Ser Ala Val Ser Hit Ser Ser Glo 1)r Uel Glj Gl; Ser 
»5 950 955 960 

AlaAlaDeGlfProilellirProAlanir'nirSerSerllellirAla 
965 970 975 

AlahlltrAbSerSerllrllirSerAlahlPnHetGlfAsnGlf 
980 985 990 

Val Glj Val Gljr Val Glj Val Gly GljAim ValSer Het1)T Ab Ata 
995 1000 1005 

Ala Gin Ibr Ala Hel Ala Leo Net Glj Val Ala Lea Hb Ser Bb Gin 
1010 lOIS 1020 

Gin Gin LenDe Glj Glj Val Ala Valljs Ser Gin Bis Ser Tbr Hv 
1025 1030 1035 1010 



(2)DiFORHA110NFORSEQIDNO:IO: 



(DSEQDEKIXCIURAnmSinS: 
WLEKCm 13 base pain 
(B)nTE:niideicadd 
(OSmiQINES&bolh 
(D)TOri)LOG¥:bolb 

'(iOM0imE'nTE:DIU(genoiiiic) 



U)NiUIE/KEV:niiscJeature 
(B)UI&(TI0lli:7 

(D) 01HER INFORIU'nOII: /fnivd= IMiM Ecdjwne 
Response Qemeiir 

/iNte= "N al position 7 is 0 up lo 5 imdeolides; 
viUi I nndeoGde being espcdall; preferred." 



Cd) SEQUEMX DESGUPnON: SEt ID KlhIO: 

RGBNNHNTGNK'Cr 13 

(2)IIIIF0RlUnOil[F0BSEQIDNO:ll: 

G)SEQUEKCECBARACTERimCS: 
(UmtCTB: 13 base pairs 
(B)'nTE:niideicacid 

(osimmmboii 

(D)TOPOU)Gr:botli 
tii)H0LECllU11TE:Dlll4(geiioniiG) 



(it)raiME: 
U)NUIE/KEr:Bl8c [ealore 
(B)L0arail:7 

(D) OMR DIFORmnON: /product- "llodined Ecdjsone 
Response Bement" 

/oote= "N at position 7 can be 0 np lo 5 
DodeoUdes, lilh I nndeolide being preferrei'' 



(iDSEIHIEKtXDESGRIPIIDItSEIIIDKO:!!: 

RGKKCUiKinilVn' 13 

(2)IIIIFORlU'n01IIFORSEQIDNO:l2: 

WSEQieCXCIURACmtmiCS: 
(4)Wnil3ba8epair8 
(BlTlTEnndeicadd 
(OSmiOIEDNES&bolb 
(D)nPOLDGr:bolb 



(iDHOlEOILE'nP&DlU (genomic) 



(iDSEIIUEIICEDESIlliniON: SEQID K0:I2: 
ACTGONTCTTCr 13 
(SINFIIBlUIIONFORSEQIDNOai 

u)SEQiMCECHmaimn(S: 

OUUSIGm 13 base pain 
(B)nTE:nideicatid 
(OSmUKDEDlUmbofli 
«) TOPOLOGY: bolb 

(iDHOLECIII£nrE:DlU(seiioinc) 



(iOFEiniRK 
tt)IUIIE/KEy:iiia;_[eatiire 
(B)LOC4TI0N:7 

<D) OMR niFORiUnOlV./proiliici^ "Ecdjsone Response 
Hemenr 

/iH)te= "N a( position 7 can be 0 op (o S 
lodeoOdet; irilh 3 nideoiides being preferred." 



bO SEUMCE DESCBirnOft SEQ D N0:13: 

RGSmRGBM 13 

(2)IlliF0B)U1101IF0RSEQIDN0:14: 

(DSEQIMGEIMUnERISinS: 
(DLfflCni: 49 base pain 
(B)Tn>fiinRlelc»id 
(OSnUNDQUIMbolb 
(D) TOPOLOGY: both 

(iDHOUGllLEnTKDlU(|eaoniiG) 



(lDmnSI(CEIIESCRlP110N:SEQIDR0:U: 

TACUCGCCC TUHTGTGG ATGCTGOUG CTGmcm CGAGGC4GC 

.(2)DIF0RlU1I0KF0RSEQIDill0:t5: 

(i) SEQUENCE OURACMISUCS: 
CUWIH: S3 base pain 
(B)Tn>E:aiK!eicacid 
(OSmNDEDNES&bolt 
(D) TOPOLOGY: both 



Qi) MOLECULE nPE: DM (genomic) 



(tO SEUIOHS DESOUPHOIII: SEQ D HO: IS: 

GTACTCCCGG G6CGGGGCTA T6C6GG6CGG GGR'AATCGC TAGGCGCGGG Wk 

(2)DiF0BlU1I0NniRSEIlIDN0:l6: 

(DSEQlMCEQimCIWnCS: 
WU^'GIV: S3 base pain 
(B)nTE:nDdeicatld 
(OmANDEDhmbotli 
(D) TOPOLOGY: bolli 

mmmMfmrniimnii) 



(li) SEQISKIX DESOUPHOIIt SEQ ID 1(0:16: 
GTAHIGCCCC GGCGCMGCGiTnGOCCCC OXCGUTAG GCCCGCCCCG GGA 



(2)DIFOR1U110IIIFORSGIIIDNO:I^ 

(i)SEQMCEGIimCITOIHS: 
U)I£KG1H: 34 base pain 
(B) TYPE: nudeic add 
(OSmUWEDKES&boih 
(D)TOFOLOGIf:boUi 

QDHOIjm£TI[PE:DIU(geMnyG) 



Od) SEQIMCE DESOUFTIOIII: SEQ ID N0:17: 
AGimiCGJlCUG1t!C4TTGTTinTr6CTGAA 34 
(2)INFOR1U1IONFORSEIID)KO:|{: 

(D SEQ1MZ ClURAPIfllfilllS: 

(A) LB(G'ni:35basepain 

(B) TtTE:nudeicacid 
(OSmlKDED^'mboib 
(D)TOrOLOGlf:boUl 

Qi)HOimEnTG:DIU(geiniiiic) 



(li)SEQIiDIIZDGSQUP1IOII:SEIllDNII:l{: 
jtCnmiGCAAIUGiUCU TGUCIT6TC (l4TCt 



